Domain decomposition and halo construction by halungge · Pull Request #540 · C2SM/icon4py

halungge · 2024-09-06T08:55:34Z

Decompose (global) grid file:

uses pymetis to decompose the global grid (cells) into n patches
after decomposition halos for all dimensions (cell, edge, vertex) are constructed. Halo construction is done in a ICON like fashion: They consist halos of 2 cell levels (one upward and one downward pointing line) and the corresponding vertices and edges on these lines.

Omissions:

LAM grids need to be investigated further:
- tests comparing decomposed vs. single_node computation are only run on the global grid.
- for the LAM grids ICON reorders arrays to arrange halo points on the first boundary layers together with the boundary layers, it should be investigated whether that is essential in the model.
- This PR does only take this into account on the computation of the start_index and end_index not in the halo construction.
the number of halo lines (in terms of cells) is hardcoded to 2, that could be made a parameter.
Not sure it all runs on GPU correctly... most probably there are some numpy cupy issues to fix.

# Conflicts: # model/common/pyproject.toml

# Conflicts: # model/common/src/icon4py/model/common/grid/base.py # model/common/src/icon4py/model/common/grid/grid_manager.py # model/common/src/icon4py/model/common/grid/simple.py # model/common/tests/decomposition_tests/mpi_tests/test_mpi_decomposition.py

model/common/src/icon4py/model/common/grid/grid_manager.py

fix test_grid_manager

# Conflicts: # model/atmosphere/dycore/tests/dycore_tests/mpi_tests/conftest.py # model/atmosphere/subgrid_scale_physics/muphys/tests/muphys/fixtures.py # model/common/pyproject.toml # model/common/src/icon4py/model/common/decomposition/definitions.py # model/common/src/icon4py/model/common/grid/grid_manager.py # model/common/tests/common/decomposition/mpi_tests/test_mpi_decomposition.py # model/common/tests/common/decomposition/unit_tests/test_definitions.py # model/common/tests/common/grid/mpi_tests/test_parallel_icon.py # model/common/tests/common/io/unit_tests/test_io.py # model/common/tests/decomposition_tests/__init__.py # model/common/tests/decomposition_tests/mpi_tests/test_mpi_decomposition.py # model/common/tests/decomposition_tests/test_mpi_decomposition.py # model/testing/src/icon4py/model/testing/datatest_utils.py # model/testing/src/icon4py/model/testing/fixtures/datatest.py # model/testing/src/icon4py/model/testing/grid_utils.py # model/testing/src/icon4py/model/testing/parallel_helpers.py

msimberg · 2026-02-11T16:56:30Z

model/common/src/icon4py/model/common/grid/icon.py

+    # TODO(msimberg): What should we do about this. (The global) num_cells is
+    # not guaranteed to be set here when used through fortran. Should we:
+    # 1. Ignore distributed?
+    # 2. Compute num_cells with a reduction?
+    # 3. Use a ProcessProperties to detect it?
+    distributed = (
+        config.num_cells < global_properties.num_cells
+        if global_properties.num_cells is not None
+        else False
+    )
+    limited_area_or_distributed = config.limited_area or distributed


This would be good to resolve before this PR is merged.

msimberg · 2026-02-12T10:14:08Z